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Abstract 

Background: WD40 domains have been found in a plethora of eukaryotic proteins, acting as scaffolding molecules 
assisting proper activity of other proteins, and are involved in multi-cellular processes. They comprise several 
stretches of 44-60 amino acid residues often terminating with a WD di-peptide. They act as a site of protein-protein 
interactions or multi-interacting platforms, driving the assembly of protein complexes or as mediators of transient 
interplay among other proteins. In Arabidopsis, members of WD40 protein superfamily are known as key regulators 
of plant-specific events, biologically playing important roles in development and also during stress signaling. 

Results: Using reverse genetic and protein modeling approaches, we characterize GIGANTUS1 (GTS1), a new 
member of WD40 repeat protein in Arabidopsis thaliana and provide evidence of its role in controlling plant 
growth development. GTS1 is highly expressed during embryo development and negatively regulates seed 
germination, biomass yield and growth improvement in plants. Structural modeling analysis suggests that 
GTS1 folds into a (3-propeller with seven pseudo symmetrically arranged blades around a central axis. Molecular 
docking analysis shows that GTS1 physically interacts with two ribosomal protein partners, a component of 
ribosome Nop16, and a ribosome-biogenesis factor L19e through (3-propeller blade 4 to regulate cell growth 
development. 

Conclusions: Our results indicate that GTS1 might function in plant developmental processes by regulating 
ribosomal structural features, activities and biogenesis in plant cells. Our results suggest that GIGANTUS1 
might be a promising target to engineer transgenic plants with higher biomass and improved growth 
development for plant-based bioenergy production. 

Keywords: Arabidopsis, Gigantusl, Gene expression, Homology modeling, Docking 



* Correspondence: simeon.kotchoni@rutgers.edu 
+ Equal contributors 

department of Biology, Rutgers University, 315 Penn St., Camden, NJ 08102, 
USA 

2 Center for Computational and Integrative Biology (CCIB), Rutgers University, 
315 Penn St., Camden, NJ 08102, USA 

Full list of author information is available at the end of the article 

O© 2014 Gachomo et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative 
BiolVlGCl C6ntFcll Commons Attribution License (http://creativecommons.Org/licenses/by/2.0), which permits unrestricted use, distribution, and 
reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication 
waiver (http://creativecommons.Org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise 
stated. 



Gachomo et al. BMC Plant Biology 2014, 14:37 
http://www.biomedcentral.com/1471-2229/14/37 



Page 2 of 1 7 



Background 

In plants, growth development, cell patterning, yield, 
and biomass accumulation are controlled by functional 
genetic networks of several orders of magnitude that will 
require various complementary research approaches to 
uncover the basis of this functional genetic complexity. 
Using a combination of reverse genetic and computa- 
tional protein modeling, we identified a protein, termed 
GIGANTUS1 (GTS1), a member of transducin/WD40 
protein superfamily that regulates growth development 
in plants. 

Transducin/WD40 repeat proteins are prominent fea- 
tures within proteins that mediate diverse protein-protein 
interactions, including those involved in scaffolding 
and the cooperative assembly and regulation of dynamic 
multi-subunit complexes [1,2]. The common and defined 
feature of these proteins are short -40 amino acid motifs, 
typically ending by Trp-Asp sequence, and usually com- 
posed of a 7-8 bladed beta-propeller fold. However, a di- 
versity of proteins has been found with 4 to 16 repeated 
units [3]. The low level of sequence conservation and 
functional diversity of WD40 domains make their identi- 
fication difficult. 

Repeated WD40 domains play central roles in bio- 
logical processes such as cell division and cytokinesis, 
apoptosis, light signaling and vision, cell motility, flower- 
ing, floral development, meristem organization, protein 
trafficking, cytoskeleton dynamics, chemotaxis, nuclear 
export to RNA processing, chromatin modification, and 
transcriptional mechanism [4]. They act as a site for 
protein-protein interaction, where the specificity of the 
proteins is determined by the sequences outside the re- 
peats themselves. Their functional importance resides 
largely on the protein surfaces. They serve as multi- 
interacting platforms in cellular networks for the assembly 
of protein complexes or mediators of transient interplay 
among other proteins [5]. Structural studies suggest that 
this property stems from their ability to interact with di- 
verse proteins, peptides or nucleic acids using multiple sur- 
faces, where the most common peptide interaction site of 
WD40 proteins is located on the top surface of the propel- 
ler close to the central channel [6]. 

Although WD40 proteins also are present in bacteria, 
e.g. Thermomonospora curvata [7] and Cyanobacterium 
synechocystis [8], WD40 domains are among the ten 
most abundant domain types across eukaryotic proteomes, 
and interactome studies suggest that they are among the 
most promiscuous interactors. Despite several WD40- 
containing proteins acting as key regulators of plant- 
specific developmental events, WD40 domains have been 
given less research attention compared to other common 
domains, for example kinase domains. Up to now, there 
have been no comprehensive 3D structural analyses of 
WD40 protein revealing the interacting partners and 



highlighting the relevance of WD40 domains mediating 
different biological functions [9]. Furthermore, in con- 
trast to other members of the p-propeller family, and 
despite being crucial for and residing in enzymatic com- 
plexes, no WD40 protein has been reported to possess 
catalytic activity. 

In order to fully understand the molecular- and 
structural-based regulatory mechanisms of GTS1 and 
the transducin/WD40 protein family, we carried out a 
comprehensive expression profiling, a mutational-based 
phenotypic characterization and a structural-based pro- 
tein modeling of functional features of GIGANTUS1 in 
Arabidopsis. This is the first study in which the inter- 
acting partners of Arabidosis GTS1 protein have been 
identified. A comprehensive molecular and structural 
analysis based on protein homology modeling, and a 
docking interaction study to elucidate the functional 
mechanisms of GTS1 protein in regulating growth de- 
velopment in Arabidopsis thaliana were carried out. 
We described the phenotypic characterization of a gtsl 
knockout mutant during growth development and assessed 
the GTS 1 -3D molecular structure and its docking features 
to uncover the regulatory relationship of GTS1 with other 
proteins. 

Methods 

Plant material and GTS1 expression profiling analysis 

Arabidopsis thaliana (ecotype Col-0) and gtsl knock- 
out mutant (T-DNA SALK_010647) from Arabidopsis 
Biological Research Center (ABRC) were used throughout 
this work. Appropriate seeds were sown on Murashige 
and Skoog (1 x MS) agar plates or soil and seedlings were 
allowed to grow under continuous illumination (120-150 
uEm -2 s _1 ) at 24° C. Seedling samples were collected at 
different developmental stages for gene expression profil- 
ing. To analyze the expression of GTS1 gene, total RNA 
was extracted with TRIzol reagent (Molecular Research 
Center) and then reversed transcribed using qScript 
cDNA Supermix (Quanta Biosciences, Gaithersburg, MD, 
USA) as previously described [10]. Thereafter, the cDNA 
was used as the template for PCR using gene-specific 
primers (Table 1), running 20 or 25 amplification cycles 
(linear range of amplification) unless otherwise noted [11]. 
The linear range of amplification was determined by run- 
ning increasing cycle numbers and analyzing the amount 
of cDNA fragments. PCR fragments were separated on 1% 
agarose gels containing ethidium bromide. A cDNA 
fragment generated from ACTIN (AT3G 18780) served 
as an internal control. T-DNA insertion in GIGANTUS1 
gene was PCR-confirmed using GIGANTUS1 gene specific 
primers and T-DNA left border (LB) primer (Table 1). The 
expression of GTS1 gene in gtsl mutant background was 
analyzed by extracting total RNA from the gtsl homozy- 
gous line as above described and reverse transcribed into 
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Table 1 Sequences of oligonucleotide primers used in this study 



Name 


Primer sequence 


Description 


GTS1-F1 


5'GAGGAGCTGCAGGG™m3' 


For RT-PCR 


GTS1-R1 


5'CAAGACGGG™ATCTGGGTAG3' 


For RT-PCR 


TDNA-LB 


5'CCGTCTCACTGGTGAAAAGAA3' 


For TDNA insertion 


GTS1-F2 


5'CTGAAACGGCAAATGGAAGAAG3' 


For complementation test 


GTS1-R2 


5'CTATGTCCTGGAAGTCGGAT3' 


For complementation test 


Act2-F 


5'GCGGATCCATGGCTGAGGCTGATGATATTCAACC3' 


For RT-PCR 


Act2-R 


5'CGTCTAGACCATGGAACA^CTGTGAACGATOC3' 


For RT-PCR 



cDNA as previously described [10], and further used in ex- 
pression analysis. 

GTS1 database search and phylogenetic analysis 

The Arabidopsis thaliana GTS1 cDNA sequence (AT 
2G47790) was obtained from the Arabidopsis-TAIR 
website and used to perform a nucleotide BLAST 
(BLASTn) search of the Oryza sativa (rice) genomic se- 
quence on the SALK Institute RiceGE2 web interface. The 
rice sequence identified as having the highest degree of 
homology to the Arabidopsis cDNA was downloaded and 
translated. The translated rice sequence was then aligned to 
the Arabidopsis protein sequence to validate the identifica- 
tion of the gene. The rice cDNA sequences were then used 
to perform a (BLASTn) search against the sequenced Zea 
mays genome on the MaizeSequence.org website. BLAST 
searches against the maize genome produced a list of BAC 
sequences that aligned to the query sequence. The BAC 
with the highest level of similarity was indicated on a gen- 
ome map. The complete BAC sequence was downloaded 
and aligned to the Oryza sativa cDNA sequence using the 
NCBI BLAST (bl2seq) algorithm. Once the putative exons 
had been identified for a specific gene homolog, the exon 
start and end positions were manually corrected based on 
the canonical splice site donor/acceptor sequences and the 
overlapping sequence from one putative exon to the next. 
Each putative maize cDNA was then translated into a pro- 
tein sequence. The GTS1 protein sequences from both 
Oryza sativa and Zea mays were aligned to the Arabidopsis 
thaliana sequence using the EMBOSS global pairwise 
alignment algorithm [12] to get the percent identity be- 
tween the proteins. Next a BLASTp search of the Homo 
sapiens proteome in the NCBI database was performed 
using the Arabidopsis thaliana GTS1 protein sequence. 
From this search the Wdr89 sequence of Homo sapiens was 
identified as the most homologous protein. The Wdr89 se- 
quence was used to retrieve (BLASTp) the Rattus norvegi- 
cus and the Mus musculis sequence homologs. Finally, a 
ClustalW2 alignment [13] was performed aligning all of the 
GTS1 protein sequences from Arabidopsis thaliana, Oryza 
sativa, and Zea mays with the Wdr89 protein sequences 
from Homo sapiens, Rattus norvegicus, and Mus musculis. 



The conserved WD40 protein sequence from Arabi- 
dopsis thaliana was identified using ScanProsite and the 
Swiss-Prot/TrEMBLE databases. The identified WD40 
sequence was used to perform a BLASTp search against 
plant proteomes on NCBI. From the list of potential 
WD40 containing proteins, the cDNA sequence of several 
GTS1 -like proteins were downloaded from Nicotiana 
benthamiana, Nicotiana tabacum, Pisum sativum, Pha- 
seolus vulgaris, Medicago truncatula, Gossypium hirsutum, 
Lycopersicon esculentum, Solanum chacoense, Solanum 
lycopersicum, and Physcomitrella patens. The collected 
plant WD40 containing cDNA sequences were combined 
with the Arabidopsis thaliana, Oryza sativa, and Zea mays 
cDNA sequences previously obtained and aligned using 
RevTrans version 1.4 [14]. Using MrAIC [15-20] model 
test software the best AIC model to use in constructing 
phylogenetic relationships was determined to be the 
General Transition Rate (GTR) with Gamma. 

Phylogenetic trees were generated in MrBayes using 
four Markov Chain Monte Carlo runs with three cold and 
one hot chain each for 1,000,000 generations sampling 
every 100 generations. The burnin was determined to be 
within the first 4000 generations for each phylogeny. The 
first 10000 generations sampled were removed and a 50% 
consensus majority tree was constructed from the 
remaining trees. Trees were then drawn using TreeView 
version 1.6.6 and rooted using the Physcomitrella patens 
sequence as an out group. 

Sequences interaction database search 

Arabidopsis thaliana GTS1, a Transducin/WD40 repeat 
protein (NCBI accession number AEC10888) was used as 
query to retrieve WD40 protein sequences from Uniprot 
(www.uniprot.org), and NCBI (www.ncbi.nlm.nih.gov) data- 
bases using BLAST tools (blast.ncbi.nlm.nih.gov/Blast.cgi). 

GTS1 protein interaction network was obtained using 
the STRING v9.0 (string-db.org) database. STRING out- 
come gave the two most possible interacting protein 
counterparts for GTS1 protein, a 60S ribosome struc- 
tural protein L19e (NCBI accession number AEE75864), 
and Nopl6 (NCBI accession number AAP21378), a pro- 
tein involved in 60S subunit ribosomal biogenesis. 



Gachomo et al. BMC Plant Biology 2014, 14:37 
http://www.biomedcentral.com/1471-2229/14/37 



Page 4 of 1 7 



Functional domains analyses 

GTS1 protein functional domains were studied by query- 
ing different structure-functional motifs and/or pat- 
terns databases such as Pfam v25.0 (pfam.sanger.ac.uk), 
Prosite (prosite.expasy.org/scanprosite), SMART v6.0 
(smart.embl-heidelberg.de), Conserved Domain Database 
(CDD) v3.02, CD ART (Conserved Domain Architecture 
Retrieval Tool) and CD-Search tools (www.ncbi.nlm.nih. 
gov/Structure/cdd/cdd.shtml), InterPRO v35.0 (www.ebi. 
ac.uk/interpro), ProDom release 2010.1 (prodom.prabi. 
fr/prodom/current/html/home.php), CATH v3.4 (www. 
cathdb.info), Superfamily vl.75 (supfam.cs.bris.ac.uk/ 
SUPERFAMILY), PIRSF (pir.georgetown.edu), and func- 
tional searched by PANTHER (www.pantherdb.org). Simi- 
lar analysis was performed for both interacting ribosomal 
protein counterparts, Nop 16 and L19e. 

Secondary structure prediction 

Secondary structural elements of the GTS1 protein were 
initially assessed for substructure conserved motifs by 
threading the sequences through the Protein Data Bank 
(PDB) (www.pdb.org) library using threading algo- 
rithm the Segmer [21]. These elements of the secondary 
structure were also confirmed by comparison with the 
results obtained with other additional 2-D structure 
servers: SSpro8 (Scratch Protein Predictor, scratch. 
proteomics.ics.uci.edu), NetSurfP ver. 1.1 (www.cbs.dtu.dk), 
and PSIPRED (bioinf.cs.ucl.ac.uk/psipred) fold servers. 
These secondary structure predictions were also per- 
formed for both interacting ribosomal protein counter- 
parts, Nopl6 and L19e. 

Structural templates searching 

Protein sequences of GTS1, Nopl6 and L19e were 
searched for homology in the Protein Data Bank (PDB). 
Homologous templates suitable for these three pro- 
teins were selected by BLASTp from the BLAST server 
(ncbi.nlm.nih.gov). The BioInfoBank Metaserver (meta. 
bioinfo.pl), which employs fold recognition for homology 
search, was also used for the selection of templates. The re- 
sults obtained by previous methods were compared with 
the results obtained by Swiss-model server for template 
identification (swissmodel.expasy.org) . 

Homology modeling 

Homology modeling was performed by the I-TASSER 
server [22]. An initial structural model was generated 
and checked for recognition of errors in 3D structures 
using ProSA (prosa.services.came.sbg.ac.at/prosa.php), and 
for a first overall quality estimation of the model with 
QMEAN (swissmodel.expasy.org/qmean/cgi/index.cgi). 

Energy minimization of the final structures was per- 
formed using GROMOS96 force field energy implemented 
in DeepView/Swiss-PDBViewer v3.7 (spdbv.vital-it.ch) in 



order to improve the van der Waals contacts and correct 
the stereochemistry of the model. 

Each structure was assessed using the following soft- 
wares: QMEAN for quality, PROCHECK (www.ebi.ac. 
uk/thornton-srv/software/PROCHECK) for stereological 
corrections, ProSA, and ANOLEA (protein.bio.puc.cl/ 
cardex/servers/anolea) for protein energy. The number 
of protein residues in the favored regions for each structure 
were calculated and visualized by The Ramachandran plot. 

Ligand-binding domains and conservational analysis 

Ligand-binding sites in the 3D protein structures were 
analyzed using Cofactor software (zhanglab.ccmb.med. 
umich.edu/COFACTOR), to identify functional homology. 
Gene Ontology (GO) terms (The Gene Ontology project) 
were used to identify functional analogs based on the 3D 
built models, indicating the possible functions and bio- 
logical pathway in which the proteins might be involved 
(www.geneontology.org). 

Conservational analyses of the proteins were made by 
generating evolutionary related conservation scores using 
ConSurf server (consurf.tau.ac.il). Structural function con- 
servation and key residues in the query proteins were iden- 
tified by ConSeq server (conseq.tau.ac.il). 

Surface electrostatic potential analysis 

The electrostatic Poisson-Boltzmann (PB) potentials 
for the surface amino acids of the structures were ob- 
tained using APBS software implemented in PyMol 0.99 
(www.pymol.org) with AMBER99, and optimized with the 
Python software package PDB2PQR. Fine grid spaces of 
0.35 A were used to solve the linearized PB equation in se- 
quential focusing multigrid calculations in a mesh of 130 
points per dimension at 310.00 K. The dielectric constants 
were two for the proteins and 80.00 for water. The output 
mesh was processed in the scalar OpenDX format to ren- 
der the isocontours and maps on the surfaces with PyMOL 
0.99. Potential values are given in units of kT per unit 
charge (k Boltzmanns constant; T temperature). 

Molecuolar docking analysis 

The analysis of interactions between GTS1 with each 
of its ribosomal counterparts (Nopl6 and L19e,) was per- 
formed using CLUSpro server [23]. During the workflow, 
backbone flexibility analysis was done using rigid-body en- 
semble docking with multiple structures derived from 
NMR, while the ZDOCK option for sampling at 6-degree 
rotational steps was used to obtain the decoys. The ener- 
gies of docked conformations from protein-protein dock- 
ing were evaluated by applying the Fast Fourier Transform 
(FFT) correlation approach.Docking scores were calculated 
by considering shape complementarity, desolvation, and 
electrostatics potential. The top docked conformations, 
along with their ZDOCK scores, were used as candidates 
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of near-native structures, and for clustering of binding sites 
where the ligand was within 10 A of its receptor. 

After clustering, the ranked complexes were subjected 
to van der Waals minimization using CHARMM, and 
the protein-inhibitor structure with the best score was 
chosen as the most fitting model for GTSl-ribosomal 
interacting proteins. 

Results 

Molecular and structural analysis of GTS1 
Expression profiling and phylogenetic analysis of GTS1 

The Arabidopsis GTS1 is highly expressed during seed 
germination and particularly accumulating in embryo, 
ovule, and endosperm, (Figure 1A, B). It is abundantly 
expressed in meristemic regions, indicating its crucial 
role in regulating cell divisions (Figure 1A). The strong 
tissue (abscission zones) specific expression pattern of 
GTS1 (Figure 1A) suggests its regulatory implication in 
plant growth developmental process (Figure 1A, B). 

We confirmed that GTS1 transcript accumulates in 
several major organs, including developing flowers, ger- 
minated seeds, young rosette leaves (Figure IB). Micro- 
array data analysis at different developmental stages also 
reveals an overlapping expression pattern of cell division/ 
growth induced genes with GTS1 (Additional file 1: 
Figure SI) as highlighted in Table 2 [24]. These genes are 
involved in transcriptional and posttranscriptional pro- 
cesses, and various biochemical pathways (Table 2). On 
the basis of the high level of amino acid identity between 
Arabidopsis GTS1 and other well characterized WD40 
protein homologs (Additional file 2: Figure S2), a phylo- 
genetic analysis showed that Arabidopsis GTS1 is clus- 
tered with the rice and maize GTS1 (Figure 1C), thus 
indicating that they are more similar to one another 
than they are to other GTS1-WD40 repeat sequences 
(Figure 1C). This cluster belongs to a subclade of the dom- 
inant clade (Figure 1C, gray branches) containing most of 
the plant GTS1 protein homologs. 

GTS1 regulates seed germination and plant growth 
development 

In order to examine the role of GTS1 in plant growth 
development, we employed a reverse genetic approach 
using a SALK_TDNA knockout insertion (Salk_010647) 
of GTS1 gene (Figure 2A) to investigate the effect of loss 
of GTS1 function in gtsl mutant. The SALK_010647 
(gtsl) line harbored a T-DNA insertion in the first exon of 
GTS1 gene (Figure 2A), which was PCR-confirmed by 
using the T-DNA-specific oligonucleotide primer LB1 and 
the GTS1 -specific primers (Table 1). We next examined 
and confirmed the knockout GTS1 mRNA transcript levels 
in gtsl compared to WT using RT-PCR (Figure 2B). When 
compared to the wild type, the homozygous gtsl mutant 
(n = 16) displayed a faster germination rate (Figure 2C-F), 



and a faster growth rate and higher biomass accumulation 
than the wild type (n = 16) (Figure 2G, H), indicating that 
GTS1 negatively regulates cell division, growth and overall 
biomass accumulation in meristemic regions. Furthermore, 
gtsl mutant (n = 22) flowers earlier (5 days earlier) than 
the WT (n = 16) (Figure 3A), as demonstrated by a reduced 
number of rosette leaves (9 ± 0.6, n = 15) compared to the 
wild type (15 ± 0.5, n = 20) at bolting time (Figure 3B). The 
mutant (gtsl) grows significantly taller than WT at the 
same day post germination (Figure 3A). 

In order to confirm that GTS1 is indeed responsible 
for these phenotypes, we performed a complementation 
test by RT-PCR amplifying a 1095 bp of GTS1 -encoding 
sequence from WT cDNA (Table 1), cloned it into the 
Smal site of the pROK2 vector [25] in front of CaMV 
35S promoter-driven overexpression [26] and stably trans- 
formed gtsl mutant background by the floral dip method 
[27]. As expected, the overexpression of GTS1 in gtsl mu- 
tant background was sufficient to abolishes the above de- 
scribed gtsl phenotypes. The complemented line displayed 
a WT-like phenotype, indicating that this GTS1 is indeed 
responsible for the phenotypic characterization in gtsl mu- 
tant. Unlike the other co-expression gene patterns (Table 2), 
we interestingly identified a strong gene-to-gene functional 
relationship between GTS1 (AT2G47790) and the riboso- 
mal protein L19e (At3gl6780) (Figure 4). This data shows 
the detail of stability of co-expression between the 2 
genes. The co-expression was supported by many fac- 
tors, where the PCA correlation remained unchanged 
regardless of the growth parameter/factor considered. 
Both axes are relative gene expression values in base-2 
logarithm against the averaged expression levels of each 
gene (Figure 4). These data argue for a strong gene-to- 
gene functional gene expression, suggesting that these two 
genes/proteins may physically interact to regulate/control 
biological processes in plants. 

Searching for structural templates 

Since we confirmed that GTS1 regulates seed germination 
and growth development (biomass yield, and flowering 
time) in plants (Figures 2 and 3), we next examined the 
protein structure scaffold and interacting partners of GTS 1 
in accomplishing its functions. In order to study the phys- 
ical interactions of GTS 1 with other proteins in regulating 
growth development in plants, we performed a Protein 
Data Bank (PDB) search for GTS1 protein with known ter- 
tiary structure in PDB. The search yielded the crystal struc- 
tures/PDB accession numbers 2h9l, 3iz6, 3ow8, 2gnq, and 
ltbg, showing the highest sequence identity (22, 17, 16, 
24, and 15%, respectively). The suitability of selected 
model was checked by BioInfoBank Metaserver, which 
returned 3D Jury scores (J-score) of 208.4 (2h9l), 200.2 
(3iz6), 198.5 (3ow8), 205.7 (2gnq), and 201.9 (ltbg) for 
GTS1, respectively. In order to confirm the best possible 
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Figure 1 (See legend on next page.) 
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(See figure on previous page.) 

Figure 1 Tissue specific expression profile and Phylogenetic analysis of GTS1. A). GENEVESTIGATOR-Microarray data showing highly expressed 

GTS1 gene in embryo, root apical meristem, root tip, abscission zone and shoot apex [24]. B) Experimental expression analysis of GTS1 showing increased 

transcript accumulation in germinated seed, young and developed rosette leaves and developed flower. C). Phylogenetic relationships between plant 

genes containing WD40 repeat domains. The GTS1 genes of Arabidopsis thaliana, Oryza sativa, and lea mays (shown in red), belong to a subclade of the 

dominate clade (gray branches) containing most of the plant genes listed. Genebank accession numbers were used for all genes with the exception of 

Arabidopsis thaliana and Oryza sativa for which the Gene ID number from the SALK database were used, 
v . J 



templates to use for building the GTS1 structure, Swiss- 
Model server was used, finding high scores (64, 61, 63, 60, 
and 69) and very low E-values (L5E-37, 2.1E-30, 4.9E-29, 
2E-10, and 1.8E-43) for the templates 2h9l, (3iz6), 3ow8, 
2gnq, and ltbg respectively. 

The same workflow was followed to obtain the best 
crystal templates in order 3D -structural protein model 
for the GTS1 ribosomal interacting partners. The search 
in the Protein Data Bank (PDB) for the protein L19e and 
Nop 16 yielded the crystal structures of 3iz5, 3jyw, and 
3u5e for L19e, and 2aje, lwOt, 2juh, 2ckx, and 2roh for 
Nop 16, showing comparable values in identity and suit- 
ability by BioInforMank Metaserver and Swiss -Model ser- 
ver analysis. 



Table 2 Tissue specific coexpressed clustering genes with 
GTS1 using GENEVESTIGATOR [24], and cross-verified in 
atted-ll co-expression analysis (http://atted.jp/) revealing 
a strong specific correlative expression with ribosomal 
proteins (see in bold) 



Gene locus 


Function 


AT2G47790 


Transducin/WD40 repeat-like superfamily protein 


(GTS1) 




At3g 13460 


Evolutionarily conserved C-terminal region 2 


At5g04290 


Kow domain-containing transcription factor 1 


At1g43700 


VIRE2-interacting protein 1 


At1g01770 


Protein of unknown function DUF1446 




(lnterPro:IPR010839) 


At5g43720 


Protein of unknown function (DUF2361) 


At 1g 74040 


2-i so propyl ma I ate synthase 1 


At2g32850 


Protein kinase superfamily protein 


At2g22090 


RNA-binding (RRM/RBD/RNP motifs) family protein 


At3g52120 


SWAP (Suppressor-of-White-APricot)/surp 




domain-containing protein/D1 1 1/G-patch 




domain-containing protein 


At3g 16780 


Ribosomal protein L19e family protein 


At2g48120 


Pale cress protein (PAC) 


At2g21580 


Ribosomal protein S25 family protein 


At1g34180 


NAC domain containing protein 16 


At2g27880 


Argonaute family protein 


At1g62990 


KNO^ED-like homeobox of Arabidopsis thaliana 7 


At1g 16430 


Surfeit locus protein 5 subunit 22 of 




mediator complex 



Quality of threading models 

To assess the quality of the protein models l-TASSER 
and Procheck analysis were performed. The l-TASSER 
analysis gave the accuracy parameters such as a C-score 
of -0.9, 0.60 ±0.14 TM-score with 1848 decoys and 
0.1467 of cluster density for GTS1, while Procheck ana- 
lysis revealed that the main chain conformations of 
GTS1 protein model were located in the acceptable re- 
gions of the Ramachandran plot. A majority of residues 
(77.6%) were in the most favorable regions, whereas 
14.5% of the residues were placed in the allowed regions, 
and 6.5% were in generously allowed regions. Only 1.4% 
of the residues were present in the disallowed regions, 
respectively. The plot of xl versus x2 torsion angles for 
each residue showed that most of the ro tamers in GTS1 
model was localized in low energy regions, iii) The ProSa 
analysis gave Z-scores of -5.71 for GTS1. The scores 
were within the range usually found for native proteins 
of similar size, i.e., -7.31, -4.02, -6.63, -7.93, and -7.33 for 
the templates 2h9l, 3iz6, 3ow8, 2gnq, and ltbg crystal 
structures, respectively, iv) QMEAN analysis of GTS1 
model revealed Q-values of 0.67. A quality factor of 
0.793, 0.315, 0.71, 0.786, and 0.849 was estimated for the 
crystal structures of the templates 2h9l, 3iz6, 3ow8, 
2gnq, and ltbg, respectively, indicating that the GTS1 
model is within the range of accuracy of the templates 
crystallographic structures, v) Root mean square devi- 
ation (RMSD) between GTS1 model and the crystal 
templates Ca backbones of the closed templates were 
2.408 A for 2gnq and 3.192 A for 2h9l. 

All of the above parameters were also determined for 
L19e and Nop 16 protein models resulting in comparable 
structural quality values of our modeled L19e and Nopl6 
proteins. 

3D structure of Arabidopsis GTS1 

We obtained the best structural models of this newly de- 
scribed Arabidopsis WD40 repeat protein, GTS1, based 
on homology modeling (Figure 5). The 3D structure of 
Arabidopsis GTS1 belongs to the transducin/WD40 re- 
peat protein family because it shares all the structural 
characteristics of WD40 proteins (Figure 5A), which 
agree with the general crystal structure of 2h9l or 2gnq 
template. In general, the structure can be visualized as a 
short, open cylinder where the strands form the walls. 
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Figure 2 Physical map of G7S7 knockout gene and phenotypic characterization of gtsi mutant. (A) The GTSI gene with the positions of exons 
(numbered black rectangles) and introns (thick lines) are represented. The 5' and 3' untranslated regions are depicted in white rectangles. The location 
of the gtsi T-DNA insertion is shown using an inverted black triangle. The names and locations of primers used for RT-PCR experiments are also indicated. 
Bar = 0. 5 kb. (B) The T-DNA insertion causes a knockout expression of the gene. The quality of the RNA and the loading control was assayed by 
monitoring ACTIN gene expression. (C-F) GTS1 negatively controls seed germination, gtsi mutant germinated faster at 1 and 3 days after incubation in 
water (D, F) than the wild type (C, E). (G-H) GTS1 controls biomass accumulation and growth development in Arabidopsis. (H), Growth rate of gtsi is 
faster than that of WT (G) at 1 5 DAG. gtsi shows larger leaf area (biomass) (H) than WT (G). DAG = Days after germination. 



At least four repeats are required to form a p-propeller. same (3-propeller (Figure 5A), potentially reinforcing the 
In our case, GTSI contains 7 WDs, where the final and structure. Despite the low amino acid sequence identity 
last (i.e., the N- and C-terminal) WDs participate in the across species, a relatively good conservation of the overall 
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A 



WT 




Figure 3 Mutation in GTSl gene promotes early flowering, 
growth development and biomass accumulation. A) a faster 
growth of gtsl mutant compared to WT is depicted with gts1 
displaying a taller phenotype than WT. B) gtsl mutant flowers earlier 
than WT as depicted by a reduced number of gtsl rosette leaves 
compared to WT at bolting time. Gtsl mutant accumulates higher 
cell biomass than WT as shown by a bigger overall gtsl rosette leaf 
area compared to WT (B). 



fold (Ca carbon chain) of this protein is found among plant 
species and eukaryotes in general [4,28]. 

Surface electrostatic potential analysis (Figure 5B) re- 
veals several prominent positively charged residues (blue 
regions), predominantly in the walls of the cylinder 



(tunnel) and C-terminal arm. The environment of this 
protein is essentially negatively charged (red regions) 
(Figure 5B), as highlighted by the Poisson - Boltzmann 
electrostatic potential. By assigning a value of +1 to basic 
residues (R, K) and -1 to the acidic residues (D, E), net 
charge of protein was calculated to be -22 for GTSl. The 
central tunnel of the GTSl structure exhibited a predom- 
inantly positive charge in the top view. Comparison be- 
tween GTSl and other WD40 repeat proteins, such as 
templates 2gnq and 2h9l did not exhibited large differ- 
ences in the general topology as it was further confirmed 
by the RMSD value of 2.408 A and 3.192 A, respectively, 
whereas significant differences were found in particular 
regions of the proteins such as the N-terminal, and C- 
terminal regions. 

Conservational and functional/ligand-binding site analysis 

The conservational and ligand-binding or functional fea- 
tures of GTSl were analyzed (Figure 6). Surfaces of Ara- 
bidopsis GTSl (rotated 180°) showing the conservation 
index of residues are depicted in Figure 6A. Consurf 
conservation analysis showed that GTSl cylinder-like 
p-propeller structure is quite well conserved, especially the 
residues located in the core of the structure (Figure 6A). 
The N- and C-terminals of the protein are other most 
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Figure 4 Correlation of expression pattern between GTSl and the ribosomal protein. Samples whose contribution is more than 1.0 are 
outputted. The probe pair giving highest correlation is selected from all combination between the probes for the two loci, 266466_at (GTS1) and 
25841 0_at (L19e) respectively. Sample contribution score is calculated as a product of z-scored expression values. The average of the score is the 
pearsons correlation coefficient. 
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Top view 



Bottom -3^. 



f] WD40 

pSheet^ 






Bottom view 



Side view 



Bottom view 



Figure 5 Structure of GTS1, a WD40 repeat protein. A) Top, bottom and side views of the seven-bladed (3-propeller structure (most stable 
form) made by using PyMol software (http://www.pymol.org/), with the N-terminal and C-terminal regions in blue and red color, respectively. A 
depicted model is included to show the basic WD40 (3-sheet structures conformation of the cylinder structure with a tunnel-like structure in the 
centre that communicate both top and bottom sides. B) 180° rotated views of the electrostatic potential representation on the GTS1 protein 
surface. The surface colors are clamped at red (-10) or blue (+10). 



conserved regions of the protein, besides the core area 
which has a major role in maintenance of the protein 
structure [9,29]. This distribution of the core conserved re- 
gion and surface variable residues helps maintain a similar 
overall fold among WD40 repeat proteins, but also pro- 
duces differences observed in terms of a multiple protein 
interactions along WD40 repeat protein, where it is indi- 
cated by a discontinued red line for both ribosomal pro- 
teins in the GTS1 structure (Figure 6A). 

Figure 2B shows a general view of the amino acids in the 
tunnel-like structure holding up the interacting peptides. 
GTS1 exhibit the putative active (most representative 
ligand-binding) site located in the center of the structure 
(Figure 6B, top view), which normally is another interacting 
polypeptide, containing several conserved but also few vari- 
able amino acids. A detailed view, showing the spatial distri- 
bution of residues responsible for the conformation of the 
ligand binding domain surrounding the ligand and directly 
implicated in this interaction are Y43, V44, F45, S61, N87, 
S107, F134, V194, S267, R32% and the peptide chain 
substrate bound to the tunnel-like structure (Figure 6B, 
detailed view at the right side). Conservational analysis of 
WD40 repeat proteins with significant close identity to Ara- 
bidopsis GTS1 among other species returned a large num- 
ber of highly variable (bold) and small number of conserved 
(italic) residues as written above. Conformational predic- 
tions indicate that the peptide (ligand) is projected to and 
partially located inside the tunnel-like structure of GTS1. 



The conservational and functional analysis of the 
ribosomal L19e and Nop 16 proteins are depicted in the 
Figure 7. Consurf conservation analysis in the Figure 6A 
showed that Nopl6 protein is well conserved, espe- 
cially in the interacting surface region (red arrows, and 
Figure 7B) and the nucleic acids (blue arrow). Only few 
light and deep blue residues are around the surface of the 
protein. Ligand-binding analysis of this protein showed 
in Figure 7B highlights the area of the Nop 16 protein 
where the interaction with nucleic acids takes place. This 
area is located in a cleft integrated by the N-terminal 
a-helix. The residues implicated in the interaction between 
the protein and the nucleic acid are L40, M41, T142> and 
R146, resulting in four not well conserved amino acids 
(italic). 

Consurf conservation analysis showed in the Figure 7C 
highlights an equally distributed number of conserved 
and variable residues along the surface of L19e protein. 
The area of interaction with GTS1 seems not to be as well 
conserved as its counterpart Nop 16. Few variable residues 
(blue color) are located in the area of interaction 
with the N-terminal tail of GTS1 (red arrows in Figure 7C). 
In addition, among the surface directly implicated in 
the interaction with the ribonucleic acid (blue arrows in 
Figures. 7C, D), S3, K5, 76, R9, L10, N36 seems to be well 
conserved, since only one residue, 16, was found to be vari- 
able (italic), and the rest exhibit an average index of conser- 
vation or highly conserved like R9 (bold). 
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Figure 6 Conservational and ligand binding domain analysis of GTS1, a WD40 repeat protein. A) Consurf-conservational analysis of GTS1 
protein showed in two individual views rotated 180°. The conserved and variable residues are presented as space-filled models and colored 
according to the conservation scores. The interacting area of the protein with ribosomal counterparts has been highlighted by a red discontinue 
line. B) Detailed view of the ligand-binding area of GTS1 with a peptide and the spatial distribution of the interacting residues in a detailed view. 



GTS1 interaction mechanism: molecular docking analysis 
with ribosomal counterparts 

In order to get insights into the GTS1 regulatory/ multi- 
interacting mode with other proteins, we analyzed the 
conformational interaction between GTS1 and two 
interacting partners involved in the structure and bio- 
genesis of ribosomes in Arabidopsis. This analysis was 
carried out by molecular docking, using newly modeled 
structures of the two ribosomal proteins. 

Figure 8 shows the mode of interaction between 
Arabidopsis GTS1 and the nucleolar protein 16 (Nopl6), 
involved in the biogenesis of 60S ribosomal subunit. The 
binding mechanism occurs through the formation of a 
stoichiometry complex 1:1 between both proteins. A de- 
tailed view of this interaction is depicted in the magnified 
views displayed in Figure 8A, where C-terminal arm of 
GTS1 is located inside the cavity made by N-terminal a- 
helix of Nop 16 and neighboring helices, in addition to the 
direct interaction of this N-t a-helix with the (3-propeller 
4 structure of GTS1. The N-t a-helix of GTS1 is covered 
completely by the Nop 16 cleft, therefore preventing access 
by other interacting partners to this area. On the other 
hand, it also partially impedes the interactions that are ne- 
cessary for binding the nucleic acid to Nop 16 by stereologi- 
cal impediment. The interaction is non-covalent, thus it 



may be reversible by increasing salt concentration and/or 
pH to alkaline conditions. Figure 8B shows the large area 
of interaction covered by the Nop 16 molecule in the GTS1 
in a perpendicular (90°) view. 

With regard to energy, the electrostatic potential ana- 
lysis for the contact surface of both structures exhibited 
a highly compatible fingerprint distribution of oppos- 
ite charges, i.e., positive (in the contact area of Nop 16 
protein), and negative (mainly for GTS1) (Figure 8C). 
There are not large areas with hydrophilic character in 
the contact surface between both proteins, and the 
formation of the complex may be mediated by a high 
number of direct and water-mediated H-bonds. 

Figure 9 shows the mode of interaction between Ara- 
bidopsis GTS1 and the 60S ribosomal protein L19e. The 
binding mechanism also occurs through the formation 
of a stoichiometry complex 1:1 between both proteins. A 
detailed view of this interaction is depicted in the magni- 
fied view (Figure 9A), where the N-terminal area of 
GTS1 hugs the thin structure of L19e, and the neighbor- 
ing a-helix contacts the p-propeller 4 of GTS 1. Addition- 
ally, the N-terminal area of the L19e protein directly 
interacts with the residues of the central bottom side of 
the GTS1 tunnel, leaving the central top side of the tunnel 
free as detailed in Figure 8B. The same area of L19e 
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Figure 7 Conservational and ligand binding domain analysis of ribosomal Nop16 and L19e proteins. A) Consurf-conservational analysis of 
Nop16 protein showed in two individual views rotated 45°. The conserved and variable residues are presented as space-filled models and colored 
according to the conservation scores. The interacting areas of the protein with GTS1 protein and ribonucleic acid have been highlighted in red and 
blue color, respectively. B) Detailed views of the ligand-binding area of Nop16 protein with a chain of ribonucleic acid and the spatial distribution of 
the interacting residues depicted in both detailed views. C) Consurf-conservational analysis of L19e protein showed in two individual views rotated 45°. 
The conserved and variable residues are presented as space-filled models and colored according to the conservation scores. The interacting areas of 
the protein with GTS1 protein and ribonucleic acid have been highlighted in red and blue color, respectively. D) Detailed views of the ligand-binding 
area of L19e protein with ribonucleic acid and the spatial distribution of the interacting residues depicted in both detailed views. 



protein is also involved in the interaction with ribonucleic 
acid (Figure 7D). The electrostatic potential analysis for 
the contact surfaces of both structures exhibited a highly 
compatible fingerprint of opposite charges (positive in the 
contact area of L19e protein and negative in the contact 
surface of GTS1) (Figure 9C). 

Discussion 

GIGANTUS1 is here described to be very important in 
regulating plant growth development (seed germination, 
faster growth, flowering time, and biomass accumulation) 
(Figures 1, 2 and 3). This is the first time that a mutation 
in GTS1 has been implicated in early germination, growth 
and development in Arabidopsis thaliana. The molecular 
mechanism by which GIGANTUS1 regulates plant growth 



development is still unknown. As a member of WD40 
protein family, GTS1 is expected to play central roles 
in different biological processes including cell division 
and cytokinesis, flowering, floral development, cyto- 
skeleton dynamics, nuclear export to RNA processing, 
transcriptional mechanism, and protein-protein interac- 
tions [4]. We postulate that GIGANTUS1 might primar- 
ily function as a site for protein-protein interaction 
or mediator of transient interplay among other proteins 
to regulate different biological processes in plants. 
The development of protein complexes involves regu- 
latory interactions that are mainly controlled by scaf- 
folding proteins, such as WD40 repeat motifs. These 
motifs are important features of diverse protein-protein 
interactions [4], providing an unbending platform for 
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Top rim 




Figure 8 Analysis of the interaction between GTS1, a WD40 repeat and Nop16 proteins. A) The complex between GTS1 (blue surface and 
cartoon representation) and Nop16 (white/gray surface and cartoon representation) from the top view. B) Surface/cartoon structure rotated 90° in 
blue (GTS1) and white/gray (Nop16) are depicted, and highlight the large interacting surface between both proteins. C) Electrostatic potential 
depicted in both interacting partners, where has been highlighted both areas involved in the interaction by light-blue discontinue arrows. The 
surface colors are clamped at red (-10) or blue (+10). 



interactions of proteins with other cellular compo- 
nents and controlling therefore several vital functions of 
the cell, such as signaling cascades, cellular transport and 
apoptosis [29-31]. 

The WD40 domains in GTS1 protein are shown to 
contain seven or multiples of seven repeats forming 
a highly stable (3-propeller structure (Figure 5A). The 
7-fold p-propeller is the most stable (3-sheet geometry 



characterizing the resolved WD40 structures and also 
used to identify WD40 proteins [32]. However members 
of this protein family have been also found to contain 
as high as sixteen repeats [33]. Proteins with less than 7 
repeats form an incomplete (3-propeller structure and 
require additional WD -repeats from their neighbors to 
stabilize themselves, making dimers [34]. There is no ap- 
parent folding order for each repeat and the order in 
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Figure 9 Analysis of the interaction between GTS1, a WD40 repeat and L19e proteins. A) The complex between GTS1 (green surface and 
cartoon representation) and L19e (white/gray cartoon representation) from a lateral view. B) Surface/cartoon structure rotated 45° in green (GTS1) 
and white/gray (L19e) are depicted, and highlight the two interacting surfaces between both proteins. C) Electrostatic potential represented in 
both interacting partners, where has been highlighted both areas involved in the interaction by black discontinue arrows. A detailed view of this 
interaction has been depicted. The surface colors are clamped at red (-10) or blue (+10). 



which repeats fold might vary among different WD40 
proteins, or even within the same protein [35]. These 
proteins are known be involved in light signaling/photo- 
morphogenesis and flowering [36], auxin response and 
cell division [37], in meristem maintenance [38], floral 
development [39], seed development and flowering [40], 
chromatin-based gene silencing and organogenesis [39], 
protein turn-over, microtubule dynamics, phospholipid 



binding and vesicle coating [4]. This justifies the great 
deal of research interest in the WD40 protein superfamily 
across plant species. Our data (Figures 2 and 3) suggest 
that GTS1 belongs to the WD40 protein subfamily regu- 
lating auxin response and cell division [37], meristem 
maintenance [38], floral development [39], seed develop- 
ment and flowering [40]. In this study the Arabidopsis 
GTS1 ligand-binding domain (main functional domain) 
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lies mainly on the top surface residues, which integrates 
parts of p-propeller domain (Figure 6B). However, our 
data revealed that GTS1 WD40 propellers have three dis- 
tinct surfaces available for interactions: the top region 
of the propeller, the bottom region (Figure 9), and the 
circumference [4,5] (Figures 8 and 9), suggesting the 
multi-functional properties of GTS1 protein through 
protein-protein interactions. Indeed, protein-protein and 
protein-peptide interactions involved the entry site of the 
central channel of the p-propeller (Figure 6B), where the 
majority of interaction partners (including small mole- 
cules) bind [5]. N- or C-terminal extensions of GTS1 run 
parallel to the tunnel-like structure, which form the 
complete 7 WD40 repeat domains, making them access- 
ible for interaction with other partners (Figures 8 and 9). 
WD40 domains can thus act as large interaction plat- 
forms for multiple protein interactions. In comparison 
to other domains, the proteins containing WD40 motif 
are components of several interaction pairs [41,42], and 
act as scaffolds for larger complex assemblies. This 
work represents the first time the 3D-structural and 
molecular features of the GTS1 protein in plants have 
been examined. 

To better understand what area of the GTS1 plays an 
important role in complex formation for functional con- 
servation across plant species, we identified two funda- 
mentally conserved regions; the first, located on the top 
rim, constituted by the residues composing blades 4 to 
7, including N- and C-terminal arms; and the second 
large conserved surface is located on the bottom of the 
propeller and is mainly composed of blades 1, 2 and 3 
(Figure 6A). These regions represent potential protein- 
protein interaction sites [43]. 

In general, plant and particularly Arabidopsis-WD 
repeat proteins are strongly conserved. Most of these 
proteins are components of basic cellular machinery 
regulating plant-specific processes. An interesting ques- 
tion arises as to how these proteins evolved into their 
specific cellular roles. One of the key functional processes 
of WD proteins is the biogenesis of eukaryotic ribosomes, 
a highly regulated and dynamic process that begins in 
the nucleolus with transcription of rRNA precursor 
(pre-rRNA) and rapidly packaged into the 90S ribo- 
nucleoprotein particle containing ribosomal proteins, 
non-ribosomal proteins, and snoRNA-containing ribonu- 
cleoprotein particles (snoRNPs). The 90S pre-RNPs are 
converted into 43S and 66S ribosome assembly interme- 
diates, which ultimately give rise to mature 40S and 60S 
ribosomal subunits [44], It is well known that ribosome 
biogenesis is driven by a large number of pre-ribosomal 
factors that associate with and/or dissociate from the 
pre-ribosomal particles along the maturation pathway. 
Although there has been much progress to identify ribo- 
some assembly intermediates and their protein and RNA 



constituents [45], the information about the architecture 
of these pre-rRNPs is scarce. It is unclear which proteins 
are the nearest neighbors within the assembled ribo- 
somes and to what extent neighboring molecules func- 
tion together. 

WD40 protein-protein interaction motifs represent 
excellent candidates to mediate interactions in the multi- 
protein subcomplex comprising a neighborhood in as- 
sembling ribosomes because of their protein-protein 
multi-interacting versatility. More than 70 trans-acting 
factors required for ribosome assembly have been identi- 
fied [46], as well as 80 additional assembly factors present 
in pre-ribosomes [47]. Therefore, such WD40-containing 
proteins may nucleate the assembly of pre-ribosomes by 
interacting sequentially or simultaneously with other as- 
sembly factors or ribosomal proteins. Among the assem- 
bly factors, 17 proteins were found to contain WD40 
motifs [48]. Many of the annotated ribosome biogenesis- 
WD40 repeat proteins were shown to directly interact 
with, or regulate the levels of other proteins [49] or to be 
components of multiprotein subcomplexes. Yeast WD40 
protein Ytml is a constituent of 66S pre-rRNPs, whose 
depletion resulted in a deficiency of 60S ribosomal sub- 
units [50]. Its homologue, mammalian WDR12 functions 
in the maturation of the 60S ribosomal subunit. WDR12 
forms a stable complex with a novel member the nucle- 
olar proteins Pesl and Bopl (Pe- BoW complex), which 
are crucial for processing of the 32S precursor ribosomal 
RNA (rRNA) and cell proliferation [36]. Interestingly, a 
potential homologous complex of Pesl-Bopl-WDR12 
in yeast (Nop7p-Erblp-Ytmlp) is involved in the control 
of ribosome biogenesis and S phase entry [51]. 

The yeast WD40 repeat protein Makll that modulates 
a p21 -activated protein kinase function is an essential 
factor in nuclear maturation of 60S ribosomal subunits 
and its depletion led to a cell cycle delay in Gl, indicating 
an early step nucleolar role of Makll in ribosome assem- 
bly. Another sub-complex, transiently associated with late, 
nuclear pre-60S precursors, is composed of four proteins 
and contains Ipi3 as a WD40 repeat member [52]. 

In this study, a new interacting counterpart, the Arabi- 
dopsis Nopl6 protein was identified as a potential ribosome 
biogenesis factor in plants, which could be implicated in 
formation of the 60S ribosomal precursor. This process 
may be regulated by an interaction with GTS1 (Figure 8). 
This interaction was studied using a docking analysis that 
showed a stable interaction between GTS1 and Nop 16, in- 
volving the N-terminal tail and the 4 th blade of the first 
partner, and a cleft formed in the second ribosomal factor 
by the N-terminal a-helix and the neighboring secondary 
elements. Another ribosomal protein (L19e protein) was 
found to interact with GTS1. L19e protein is implicated in 
the structural stability of ribosome. The interacting area be- 
tween GTS1 and L19e is very close to that of Nopl6 and 
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pattern with GTS1 (see Table 2 for detail description). Data analysis was 
retrieved from Genevestigator [24]. 

Additional file 2: Figure S2. The plant GTS1 proteins are most similar 
to the animal Wdr89 protein. ClustalW alignment of the plant GTS1 
proteins from Arabidopsis tha liana (AtGTSI), Oryza sativa (OsGTSI), and 
lea mays (ZmGTSI), with the animal Wdr89 proteins from Homo sapiens 
(HsWdr89), Ratus norvegicus (RnWdr89), and Mus musculus (MmWdr89) 
show several conserved residues across the entire length of the protein 
(gray shaded residues). Oryza sativa (OsGTSI) and lea mays (ZmGTSI) 
share 53.8% and 52.7% identity with AtGTSI protein, respectively. WD40 
repeat domains are underlined for all three plant GTS1 proteins. 



GTS1 interacting area (Figure 8). This suggests that the in- 
teracting mechanism of regulating the biogenesis nucleolar 
factor Nopl6 and the structural ribosome factor L19e may 
be competitive. Therefore both steps in the 60S ribosomal 
subunit formation, structural maturation and stabilization 
may be separate in the time and/or different cellular 
compartments. 

In support of our data are two other examples of 
WD40 repeat ribosome biogenesis factors, Rrbl and 
Sqtl, which interact directly with ribosomal proteins for 
60S ribosomal subunit assembly. Rrbl interacts with the 
ribosomal protein Rpl3 in the nucleus and regulates its 
levels [53], and Sqtl interacts with RpUO in the cyto- 
plasm [54]. Both proteins have a role in the association 
of the corresponding ribosomal protein with the nascent 
60S ribosomal subunits and might regulate the levels of 
the corresponding ribosomal protein. Other WD40 re- 
peat proteins have been implicated in the formation and 
stabilization of the small ribosomal subunit 40S. Yeast 
RACK1 regulates the translation initiation by recruiting 
PKC to the ribosome [55,56]. Four RACK1 orthologs 
identified in Arabidopsis thaliana may have a similar 
activity [57]. These interactions could provide a mechan- 
ism to regulate translation activities of ribosome popula- 
tions programmed with specific mRNAs [58]. 

Conclusions 

The present study provides substantial evidence for the 
role of GIGANTUS1 in controlling seed germination, 
faster growth and biomass accumulation in plants. The 
gene is mainly expressed in meristemic regions and is 
therefore important in cell division. It is postulated to 
regulate growth development through diverse protein- 
protein interactions, including those involved in scaf- 
folding and dynamic multi-subunit complexes such as 
the ribosomal protein biogenesis, stability and activity. 
Given its rich interaction surfaces, GTS1 functions prob- 
ably as an adaptor in many different protein com- 
plexes or protein-DNA complexes in very diverse cellular 
processes that needs further research investigation. Our 
modeling data suggests that GTS1 mediates molecular rec- 
ognition events mainly through the smaller top surface of 
domain, which comprises three residues forming a transi- 
ent complex with other peptides. It would be interesting to 
further investigate GIGANTUS1 knockout genes in agro- 
nomically important crops with the aim of improving crop 
yield and biomass accumulation for sustainable plant- 
based biofuel production. 
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Additional file 1: Figure SI. Hierarchical clustering microarray 
expression analysis of GTS1 and other selected tissue specific genes. 
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